SBS 2016 Track Mining: Classification with Linguistic Features for Book Search Requests Classification
نویسندگان
چکیده
In this paper, we describe text mining approaches dedicated to the classification track in Social Book Search Track Lab 2016. This track aims to exploit social knowledge extracted from LibraryThing and Reddit collections to identify which threads on online forums are book search requests. Our proposed classification model is based on combination of different textual features, namely : (i) basic linguistic features such as nouns and verbs; and, (ii) composed features such term sequences and noun phrases generated. Then, we applied a NaiveBayes classifier to specify the user’s intentions in the requests.
منابع مشابه
Overview of the SBS 2016 Mining Track
In this paper we present an overview of the mining track in the Social Book Search (SBS) lab 2016. The mining track addressed two tasks: (1) classifying forum posts as book search requests, and (2) linking book title mentions in forum posts to unique book IDs in a database. Both tasks are important steps in the process of solving complex search tasks within online reader communities. We prepare...
متن کاملKNOW At The Social Book Search Lab 2016 Mining Track
This paper describes our system for the mining task of the Social Book Search Lab in 2016. The track consisted of two task, the classification of book request postings and the task of linking book identifiers with references mentioned within the text. For the classification task we used text mining features like n-grams and vocabulary size, but also included advanced features like average spell...
متن کاملVerbose Query Reduction by Learning to Rank for Social Book Search Track
In this paper, we describe our participation in the INEX 2016 Social Book Search Suggestion Track (SBS). We have exploited machine learning techniques to rank query terms and assign an appropriate weight to each one before applying a probabilistic information retrieval model (BM15). Thereafter, only the top-k terms are used in the matching model. Several features are used to describe each term,...
متن کاملLinking Task: Identifying Authors and Book Titles in Verbose Queries
In this paper, we present our contribution in INEX 2016 Social Book Search Track. This year, we participate in a new track called Mining track. This track focuses on detecting and linking book titles in online book discussion forums. We propose a supervised approach based on Support Vector Machine (SVM) classification process combined with Conditional Random Fields (CRF) to detect book titles. ...
متن کاملQuery Expansion by Word Embedding in the Suggestion Track of CLEF 2016 Social Book Search Lab
The Social Book Search (SBS) Lab is part of CLEF 2016 lab series. This is the fourth time that the CYUT CSIE team attends the SBS track. The content of topics has changed a little bit by the organizer; therefore, we make necessary modification on our system, which is based on keyword searching and ranking by social features. This year, we design a query expansion module which is based on word2v...
متن کامل